Effect of the prior distribution of SNP effects on the estimation of total breeding value
نویسندگان
چکیده
BACKGROUND Five main methods, commonly applied in genomic selection, were used to estimate the GEBV on the 15th QTLMAS workshop dataset: GBLUP, LASSO, Bayes A and two Bayes B type of methods (BBn and BBt). GBLUP is a mixed model approach where GEBV are obtained using a relationship matrix calculated from the SNP genotypes. The remaining methods are regression-based approaches where the SNP effects are first estimated and, then GEBV are calculated given the individuals' genotypes. METHODS The differences between the regression-based methods are in their prior distributions for the SNP effects. The prior distribution for LASSO is a Laplace distribution, for Bayes A is a scaled Student-t distribution, and the Bayes B type methods have a Spike and Slab prior where only a proportion (π) of SNP has an effect, following a given distribution. In this study, two different distributions were considered for the Bayes B type methods: (i) normal and (ii) scaled Student-t. They are referred here as the BBn and BBt methods, respectively. These prior distributions are defined by one or more parameters controlling their scale/rate (λ), shape (df) or proportion of SNP with effect (π). LASSO requires one (λ); two for Bayes A (λ, df) and Bayes Bn (λ, π); and three for Bayes Bt (λ, df, π). In this study, all parameters were estimated from the data. An extra scenario for Bayes A and BBt was included where df was not estimated but fixed to 4 (suffixed _4df). The implementation of GBLUP was done using ASREML, the heritability was also estimated from the data. All other methods were implemented using a MCMC approach. RESULTS All Bayes A and B methods showed accuracy (correlation between True and Estimated BV) as high as 0.94 except for BA_4df (r = 0.91). Compared to the traditional BLUP using pedigree information, these methods improved the accuracy between 50 and 55%. GBLUP and LASSO were less accurate (0.81 and 0.85 respectively) and the improvements were 34 and 40% compared to BLUP. CONCLUSIONS Results of all methods were consistent and the accuracies for GEBV ranged between 0.81 and 0.94. When all parameters were estimated the results were similar for the Bayes A and Bayes B methods. Results showed that Bayes A was more sensitive to the changes in the shape parameter, and the parameter changes led to change in the accuracy of GEBV. However BBt was more robust to the change in this parameter. This may be explained by the fact that BBt estimates one extra parameter and it can buffer against a non-proper shape parameter.
منابع مشابه
Effect of Markers Effect Estimation Methods, Population Structure and Trait Architercture on the Accuracy of Genomic Breeding Values
This study aimed to investigate the effect of the method of estimating the effects of markers , QTLs distribution, number of QTLs, effective population size and trait heritability on the accuracy of genomic predictions. Two effective population sizes, 100 and 500 individuals, were simulated by QMSim software. A 100 cM genome including one chromosome was simulated where 500 SNPs and two diffe...
متن کاملThe Effect of Uncoupling Protein Polymorphisms on Growth, Breeding Value of Growth and Reproductive Traits in the Fars Indigenous Chicken
The avianuncoupling protein (avUCP) is a member of the mitochondrial transporter superfamily that uncouples proton entry in the mitochondrial matrix from ATP synthesis. The polymerase chain reaction restriction fragment length polymorphism (PCR-RFLP) method was used to estimate the allele and genotype frequencies of the UCP/HhaI polymorphisms and to determine associations between these polymorp...
متن کاملClassic and Bayes Shrinkage Estimation in Rayleigh Distribution Using a Point Guess Based on Censored Data
Introduction In classical methods of statistics, the parameter of interest is estimated based on a random sample using natural estimators such as maximum likelihood or unbiased estimators (sample information). In practice, the researcher has a prior information about the parameter in the form of a point guess value. Information in the guess value is called as nonsample information. Thomp...
متن کاملEstimation of parameter of proportion in Binomial Distribution Using Adjusted Prior Distribution
Historically, various methods were suggested for the estimation of Bernoulli and Binomial distributions parameter. One of the suggested methods is the Bayesian method, which is based on employing prior distribution. Their sound selection on parameter space play a crucial role in reducing posterior Bayesian estimator error. At times, large scale of the parametric changes on parameter space bring...
متن کاملEstimation of genomic breeding values using the Horseshoe prior
BACKGROUND A method for estimating genomic breeding values (GEBV) based on the Horseshoe prior was introduced and used on the analysis of the 16(th) QTLMAS workshop dataset, which resembles three milk production traits. The method was compared with five commonly used methods: Bayes A, Bayes B, Bayes C, Bayesian Lasso and GLUP. METHODS The main difference between the methods is the prior distr...
متن کاملارزیابی ژنومی صفات آستانه ای با معماری های ژنتیکی متفاوت با استفاده از روشهای بیزی
The current study was carried out to evaluate accuracy of some Bayesian methods for genomic breeding values prediction for threshold traits with different types of genetic architecture based on distribution of gene effect and QTL numbers. A genome consisted of 3 chromosomes of 100 CM with 2000 single nucleotide polymorphisms (SNP) was simulated. The QTL numbers were 0.01, 0.05 and 0.1 of total ...
متن کامل